Cognitive Speech Coding
نویسندگان
چکیده
Speech coding is a field where compression paradigms have not changed in the last 30 years. The speech signals are most commonly encoded with compression methods that have roots in Linear Predictive theory dating back to the early 1940s. This paper tries to bridge this influential theory with recent cognitive studies applicable in speech communication engineering. This tutorial article reviews the mechanisms of speech perception that lead to perceptual speech coding. Then it focuses on human speech communication and machine learning, and application of cognitive speech processing in speech compression that presents a paradigm shift from perceptual (auditory) speech processing towards cognitive (auditory plus cortical) speech processing. The objective of this tutorial is to provide an overview of the impact of cognitive speech processing on speech compression and discuss challenges faced in this interdisciplinary speech processing field. In this context, it covers the traditional speech coding techniques as well as emerging approaches facilitated by deep learning computational methods. The tutorial points out key references on fundamental teachings of psycholinguistics and speech neuroscience and provides a valuable background to beginners and practitioners on the promising directions of incorporating principles of cognitive speech processing in speech compression.
منابع مشابه
Statistical parametric speech synthesis with a novel codebook-based excitation model
Speech synthesis is an important modality in Cognitive Infocommunications, which is the intersection of informatics and cognitive sciences. Statistical parametric methods have gained importance in speech synthesis recently. The speech signal is decomposed to parameters and later restored from them. The decomposition is implemented by speech coders. We apply a novel codebook-based speech coding ...
متن کاملThe Role of L2 Private Speech in Cognitive Regulation of Adult Foreign Language Learners
The present study investigated the use of L2 private speech by English foreign language (EFL) learners in regulating their mental activities. Thirty intermediate adult EFL learners took a test of solving challenging English riddles while their voices were being recorded. Following, instances of the produced private speech were analyzed in terms of form, content, and function. Numerous instances...
متن کاملEffects of sound pillow in the treatment of stuttering and cognitive phonemes impairment in children
Introduction:Verbal language is Fundamental component for expressing ideas, social interaction and understanding educational materials. Effective communications require verbal language skills. Sound pillows may partly address the children with behavior problems. The purpose of this study was assessing the effect of educational sound pillow in the treatment of stuttering and cognitive phonemes i...
متن کاملAchievable Secrecy Rate Regions of State Dependent Causal Cognitive Interference Channel
In this paper, the secrecy problem in the state dependent causal cognitive interference channel is studied. The channel state is non-causally known at the cognitive encoder. The message of the cognitive encoder must be kept secret from the primary receiver. We use a coding scheme which is a combination of compress-and-forward strategy with Marton coding, Gel’fand-Pinsker coding and Wyner’s wire...
متن کامل1 1 2 3 A mutual information analysis of neural coding of speech by low 4 frequency MEG phase information 5
3 A mutual information analysis of neural coding of speech by low 4 frequency MEG phase information 5 Gregory B. Cogan & David Poeppel 6 7 1 Neuroscience and Cognitive Science, University of Maryland College Park 8 2 Department of Psychology, NYU 9 3 Center for Neural Science, NYU 10 11 Running Head: Mutual Information and MEG Phase 12 13 Address for Correspondence 14 Gregory B. Cogan 15 Depart...
متن کامل